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ABSTRACT 

The spatial, kinematic, and elemental-abundance structure of the Milky Way's stellar disk is com- 
plex, and has been difficult to dissect with local spectroscopic or global photometric data. Here, 
we develop and apply a rigorous density modeling approach for Galactic spectroscopic surveys that 
enables investigation of the global spatial structure of stellar sub-populations in narrow bins of [a/Fe] 
and [Fe/H], using 23,767 G-type dwarfs from SDSS / SEGUE, which effectively sample 5 < R GC < 12 
kpc and 0.3 < \Z\ < 3 kpc. Wc fit models for the number density of each such ([a/Fe] & [Fe/H]) 
mono- abundance component, properly accounting for the complex spectroscopic SEGUE sampling of 
the underlying stellar population, as well as for the metallicity and color distributions of the samples. 
We find that each mono-abundance sub-population has a simple spatial structure that can be de- 
scribed by a single exponential in both the vertical and radial direction, with continuously increasing 
scale heights («200 pc to 1 kpc) and decreasing scale lengths (>4.5 kpc to 2 kpc) for increasingly 
older sub-populations, as indicated by their lower metallicities and [a/Fe] enhancements. That the 
abundance-selected sub-components with the largest scale heights have the shortest scale lengths is 
in sharp contrast with purely geometric 'thick-thin disk' decompositions. To the extent that [a/Fe] 
is an adequate proxy for age, our results directly show that older disk sub-populations are more cen- 
trally concentrated, which implies inside-out formation of galactic disks. The fact that the largest 
scale-height sub-components are most centrally concentrated in the Milky Way is an almost inevitable 
consequence of explaining the vertical structure of the disk through internal evolution. Whether the 
simple spatial structure of the mono-abundance sub-components, and the striking correlations be- 
tween age, scale length, and scale height can be plausibly explained by satellite accretion or other 
external heating remains to be seen. 
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1. INTRODUCTION 

The formation of galactic disks is a long-standing 
problem in galaxy formation. In numerical simulations, 
disks form through gas dissipation (Sandage et al. 1970; 
Larson 1976), and the formation of the outer regions of 
the disk happens on longer time scales than the inner 
disk (Gott & Thuan 1976; Larson 1976; Katz & Gunn 
1991). The disks that form have exponential density 
profiles (Lake & Carlberg 1988), possibly due to the de- 
tailed conservation of angular momentum of an initially 
spherical cloud in solid-body rotation (Fall & Efstathiou 
1980: Gunn 1982). Yet, direct observational evidence 
for this picture is scant (e.g., Dalcanton et al. 1997; 
Somerville et al. 2008; Wang et al. 2011), and forming 
realistic disks within the ACDM paradigm remains chal- 
lenging (e.g., Abadi et al. 2003: Scannapieco et al. 2009; 
Guedes et al. 2011). 
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Central to the question as to how galactic disks 
form and evolve is the existence of "thick" disk 
components. First discovered in external galaxies 
(Tsikoudi 1979; Burstein 1979; van der Kruit & Searle 
1981), thick-disk components represent excess light or 
stars beyond the canonical thin disk's exponential ver- 
tical profile. The Milky Way's thick-disk component 
(Yoshii 1982; Gilmore & Reid 1983; Reid & Majewski 
1993; Majewski 1993; Juric et al. 2008) provides us 
with a detailed look at this common galactic compo- 
nent. Generally, thick-disk components are found to 
be old (Bensby et al. 2005; Yoachim & Dalcanton 2008), 
kinematically hot (Chiba & Beers 2000; Soubiran et al. 
2003; Gilmore et al. 2002; Yoachim & Dalcanton 2005), 
and metal-poor (compared to the thin-disk compo- 
nents), as well as enhanced in a-elements (Fuhrmann 
1998; Prochaska et al. 2000; Tautvaisiene et al. 2001; 
Bensby et al. 2003; Feltzing et al. 2003; Mishenina et al. 
2004; Bensby et al. 2005; Reddy et al. 2006; Haywood 
2008). Density decompositions of the stellar disk 
into thinner and thicker components of external galax- 
ies and the Milky Way have found that thicker- 
disk components have larger scale heights and longer 
scale lengths than their corresponding thin disk 
(Robin et al. 1996; Buser et al. 1999; Chen et al. 2001; 
Ojha 2001; Neeser et al. 2002; Larsen & Humphreys 
2003; Yoachim & Dalcanton 2006; Pohlen et al. 2007; 
Juric et al. 2008). However, these decompositions arc 
purely geometric, and do not take kinematics or abun- 
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dance information into account when assigning thinncr- 
or thicker-disk membership. 

A number of qualitatively very different models have 
been proposed for the formation of thick-disk compo- 
nents. External mechanisms, such as the direct accretion 
of stars from a disrupted satellite galaxy (Abadi et al. 

2003) or the heating of a pre-existing thin disk 
through a minor merger (Quinn et al. 1993; Wyse et al. 
2006; Kazantzidis et al. 2008; Villalobos & Helmi 2008; 
Moster et al. 2010), can explain many of the observed 
properties of thick-disk components. Thick-disk com- 
ponents can also be formed internally through star for- 
mation following a gas- rich merger (e.g., Brook et al. 

2004) or by quiescent internal dynamical evolution 
(Schonrich & Binney 2009a,b; Loebman et al. 2011). 

The idea that the thick-disk component could arise 
in good part through internal evolution is an intrigu- 
ing possibility, as it explains a range of other ob- 
servations. Significant redistribution of angular mo- 
mentum without radially heating the disk ("radial 
migration") happens naturally if spiral structure is 
transient (Sellwood & Binney 2002; Roskar et al. 2008, 
2011), and has also been shown to occur through 
bar-spiral structure interactions (Minchev & Famaey 
2010; Minchev et al. 2011). It can also be induced 
by an orbiting satellite (Quillen et al. 2009; Bird et al. 
2011). The transient nature of spiral structure is fa- 
vored both theoretically (e.g., Sellwood & Carlberg 1984; 
Carlberg k Sellwood 1985; Sellwood & Lin 1989) and 
observationally from surveys of the Solar neighbor- 
hood (e.g., Dehnen 1998; De Simone, Wu, & Tremaine 
2004; Bovy et al. 2009a; Bovy & Hogg 2010; Sellwood 
2011) and of external galaxies (e.g., Meidt et al. 2009; 
Foyle et al. 2011). Radial migration naturally explains 
the flatness and spread in the age-metallicity relation in 
the Solar neighborhood, as the large-scale changes in the 
guiding radii of stars tend to flatten radial-abundance 
gradients. A thicker-disk component arises through ra- 
dial migration when stars from the inner Galaxy migrate 
outward, where the gravitational attraction toward the 
mid-plane is smaller, such that they reach larger heights 
above the plane. However, to date, radial-migration 
models have essentially only been confronted with data at 
the Solar radius, and observational tests to discriminate 
formation scenarios for thicker-disk components have not 
been conclusive (e.g., Dierickx et al. 2010). 

Because radial migration is effectively a diffusion pro- 
cess, it complicates, if not erases, the link between the 
present-day chemo-orbital distribution and the orbital 
characteristics and abundance distribution at the time 
of a given stars' birth. Without detailed modeling of 
the episodes of transient spiral structure (or the equiv- 
alent in other radial- migration scenarios), reconstruct- 
ing the radial and azimuthal actions is problematic as 
well. However, the vertical action is an adiabatic invari- 
ant during the slow change in the vertical potential that 
ensues from this migration. The overall (mass-weighted) 
radial structure of the disk is left relatively unchanged 
as radial migration proceeds (Sellwood & Binney 2002; 
Minchev et al. 2011) — essentially because any redistri- 
bution of the surface-mass density would provide energy 
to heat the disk, and to avoid this heating the overall 
surface-mass density profile needs to be conserved — but 
if different (age- or abundance-) components of the disk 



have a different initial structure, radial migration will 
work to bring the spatial distribution of different popu- 
lations closer to the mean. 

In this paper we implement for the first time an alter- 
native approach to globally 'dissecting' the Milky Way's 
stellar disk: we study the overall (vertical and radial) 
spatial structure of large samples of stars selected to 
be sub-populations in the elemental-abundance space 
spanned by metallicity [Fc/H] and a- abundance [a/Fe] 8 , 
as it is becoming increasingly clear that a character- 
ization of the thicker disk components based only on 
stellar abundances is superior to kinematic definitions 
(Navarro et al. 2011; Lee et al. 2011b). The [a/Fe] ra- 
tio in particular is a crucial parameter, as it can be 
used as a relative age indicator (Wyse & Gilmore 1988). 
At early times, the low-metallicity interstellar medium 
is enriched by type II supernovae (SNcII). After about 
2 to 3 Gyr, type la SNe occur (e.g. Maoz et al. 2011), 
and the stellar yields shift toward Fe, leading to a de- 
creasing [a/Fe] with increasing age. Therefore, popu- 
lations of stars with enhanced [a/Fe] ratios are chem- 
ically older than those with [a/Fe] closer to the solar 
ratio. By using the SDSS / SEGUE G-dwarf sample, we 
observe stars globally across the Milky Way, constraining 
their vertical distributions from 300 pc to 4 kpc from the 
mid-plane, and their radial densities from Galactocentric 
radii ranging from 5 to 12 kpc. We show that the scale 
length of the a-enhanced — and thus probably oldest — 
population is much shorter than that of the chemically 
more-evolved stars with solar [a/Fe]. This is opposite 
to previous disk decompositions into thicker and thin- 
ner components that make use of geometric information 
alone (e.g., Juric et al. 2008, and see above). Also, we do 
not detect any discontinuity in the vertical scale- height 
as a function of [a/Fe] that might be expected if the 
thick-disk component was formed through a singular ex- 
ternal or internal event, but instead observe a continuous 
increase in scale- height with [a/Fc]. This casts doubt on 
how sensible or useful it is to think of distinct thin- and 
thick-disk components in the Milky Way. 

The outline of this paper is as follows. In § 2 we present 
the details of our data sample. Our density-fit method- 
ology, accounting for the various aspects of the SEGUE 
selection function, is given in § 3. We give the results of 
the density fits to the various abundance-selected sam- 
ples in § 4, and discuss these results in terms of disk for- 
mation and evolution models in § 5. We summarize the 
main conclusions of the paper in § 6. The appendices 
describe our model for the SEGUE selection function, 
some details of our fitting methodology, and detailed 
comparisons between our fits and the data. Modeling 
the spectroscopic SEGUE selection function is central to 
our analysis. It is described in an appendix to aid the 
readability of the paper, as its implementation requires a 
detailed and hence extensive description that may not be 
of interest to all readers. Throughout this paper, we as- 
sume that the Sun's displacement from the mid-plane is 
25 pc toward the North Galactic Pole (Chen et al. 2001; 
Juric et al. 2008), and that the Sun is located at 8 kpc 
from the Galactic center (e.g., Bovy et al. 2009b). 



8 The [o/Fe] ratio in this paper is an average of the [Mg/Fe], 
[Si/Fe], [Ca/Fe], and [Ti/Fe] ratios (Lee et al. 2011a). 
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2. DATA 

The Sloan Digital Sky Survey (SDSS; York et al. 2000) 
has obtained u,g,r,i and z CCD imaging of ps 10 4 deg 2 
of the northern and southern Galactic sky (Gunn et al. 
1998; Stoughton et al. 2002; Gunn et al. 2006). All 
the data processing, including astromctry (Pier et al. 
2003) , source identification, dcblcnding, and photometry 
(Lupton et al. 2001), calibration (Fukugita et al. 1996; 
Hogg et al. 2001; Smith et al. 2002; Ivezic et al. 2004; 
Padmanabhan et al. 2008) , and spectroscopic fiber place- 
ment (Blanton et al. 2003), are performed with auto- 
mated SDSS software. The SDSS spectroscopic sur- 
vey uses two fiber-fed spectrographs that have 320 fibers 
each. 

The Sloan Extension for Galactic Understanding and 
Exploration {SEGUE '; Yanny et al. 2009) is a low- 
resolution (R ps 2,000) spectroscopic sub-survey of the 
SDSS focused on Galactic science. We select a sam- 
ple of G-type dwarfs from the SDSS /SEGUE Data Re- 
lease 7 (DR7; Abazajian et al. 2009). G-type dwarfs 
are the most luminous tracers whose main-sequence life- 
time is larger than the expected disk age at basically 
all metallicities. G-type stars are selected from the full 
DR7 SEGUE sample using a simple color-magnitude-cut 
that corresponds to the SEGUE G-star target type: 0.48 
< .9 — r < 0.55 and r < 20.2. All magnitudes here and in 
what follows are absorption-corrected and dereddened, 
respectively, using the reddening maps of Schlegel et al. 
(1998); as we only use lines of sight with relatively small 
extinction and we do not use the SDSS u band, using 
the improved reddening maps of Schlafly & Finkbeiner 
(2011) leads to insignificant differences for the purpose of 
our analysis. We further limit the spectroscopic sample 
to those lines of sight with E(B — V) < 0.3, to minimize 
effects due to uncertainty in extinction, to objects having 
spectra with signal-to- noise ratio SN > 15, and to objects 
with valid metallicities, heliocentric linc-of-sight veloci- 
ties, and proper motions (even though the latter two are 
not used in the analysis). All of the selected objects 
have valid values for their stellar atmospheric param- 
eters as determined by the SEGUE Stellar Parameter 
Pipeline (Lee et al. 2008a,b; Allende Prieto et al. 2008; 
Lee et al. 2011a; Smolinski et al. 2011). Typical uncer- 
tainties in these parameters are 0.2 dex for the spectro- 
scopic mctallicity [Fc/H], 0.1 dex for [a/Fc], 0.25 dex for 
the surface gravity log g, and 180 K for the effective tem- 
perature (Schlesinger et al. 2010; Smolinski et al. 2011). 
In what follows we are primarily interested in the rela- 
tive rankings of stars based on [a/Fc] and [Fc/H], such 
that random uncertainties are all that matter. Note that 
our signal-to- noise-ratio cut of SN > 15 is more inclu- 
sive than recommended by Lee et al. (2011a) (who rec- 
ommend SN > 20), but this does not increase the uncer- 
tainties in [a/Fe] by much. For dwarfs with [Fe/H] > — 2, 
there is no significant correlation between the [Fc/H] and 
[a/Fe] estimates. We use this sample of G-type stars to 
determine the SEGUE G-star selection function in Ap- 
pendix A below. 

We select G-type dwarfs by selecting stars with log 
g > 4.2, to eliminate giant stars (we have verified that 
more conservative log g cuts give the same results, see 
Appendix C). We perform no other cuts (e.g., other color 
cuts or distance cuts) beyond these basic cuts in order 
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Fig. 1. — Comparison between distance moduli derived from the 
Ivezic et al. (2008) photometric distance relation (their eqn. A7), 
and those derived from the An et al. (2009) theoretical isochrones. 



to preserve a relatively simple spatial selection function. 
This sample contains about 28,000 stars, 23,767 of which 
lie within the well-populated bins in the ([Fe/H], [a/Fe]) 
plane that we analyze below. Distances to individual 
stars are obtained from the Ivezic et al. (2008) photomet- 
ric color-metallicity-absolute-magnitude relation (their 
cqn. A7) applied to the g — r color, rather than the 
g — i color, using 
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and employing the spectroscopic metallicity. These dis- 
tances are about 10 percent larger than the distances ob- 
tained from the An et al. (2009) stellar isochrones, with 
little to no color or metallicity dependence over the color 
and metallicity ranges considered here (see FIG. 1 and 
further discussion in § 5). Individual distance uncertain- 
ties are typically < 10 percent, and thus do not greatly 
smooth the underlying Galactic density, whose scales 
are much larger than this (for an illustration of this see 
Juric et al. 2008, where much larger distance uncertain- 
ties of around 20 percent were shown to influence the 
inferred scale heights by less than 5 percent). 

The distribution of the G-dwarf sample in the elemen- 
tal abundance space, made up of [Fc/H] and [a/Fe], is 
shown in FIG. 2. This distribution is characterized by 
two modes, one a metal-poor, a-enhanced population 
that must represent the oldest part of the Galactic disk, 
and another that is metal-rich and has a solar [a/Fe] ra- 
tio. The two boxes delineated by dashed lines constitute 
our broad separation of these two populations, which we 
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[Fe/H] 

Fig. 2. — Distribution of the spectroscopic sample of G dwarfs in 
elemental-abundance space. The density is linear, and the contours 
contain 68, 95, and 99 percent of the distribution. Outliers beyond 
99 percent are individually shown. Our cuts to select a-old (top, 
left) and a-young (bottom, right) samples are shown as dashed 
boxes. The dotted lines indicate the median [Fe/H] for the a-old 
sample (rounded to the nearest 0.05 dex), used to split the a-old 
sample in [Fc/H], and for [a/Fc] < 0.25 the dotted box indicates 
the metal-poor o-young sample, used in §§ 4.1 and 4.2. 

will refer to as a-old and a-yovmg, respectively 

a-old sample : 

-1.5 < [Fc/H] < -0.25,0.25 < [a/Fe] < 0.50, (2) 

a-young sample : 

-0.3 < [Fc/H] < 0.25,0.00 < [a/Fe] < 0.25. (3) 

The spatial distributions of the a-old and a-young G- 
dwarf samples are shown in FIG. 3, without accounting 
for the selection function. It is clear from this figure 
that the bright limit of the G-dwarf sample (r > 14.5, 
see below) is such that the effective minimum distance 
is approximately 600 pc. This means that for the thin- 
ner disk components, stars within one scale height are 
not sampled by SDSS / SEGUE (this is also apparent in 
FIG. 2, where most stars have sub-solar metallicities). 
However, because the thinner components also contain 
the most stars (see below and Bovy et al. 2012a), there 
are still sufficient stars above one scale height of these 
components such that the G-dwarf data set contains a 
large number of them. 

Understanding and modeling the SEGUE selection 
function, i.e., the relation between the stars with suc- 
cessfully determined spectral parameters that enter our 
sample, and their photometric or volume-complete par- 
ent population, is central to any analysis that involves the 
spatial structure of spectroscopically-sclccted samples. It 
has not been worked out previously, and while in princi- 
ple straightforward, it requires attention to a number of 
details. We describe our model for the SEGUE selection 
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Fig. 3. — Spatial distribution of the spectroscopic sample of a- 
old (top panel) and a-young (bottom panel) G dwarfs in the R, Z 
plane. 

function in Appendix A. The SEGUE G-star sample was 
obtained by uniformly sampling the dereddened color- 
magnitude boxes with color range 0.48 < g — r < 0.55 
and a "bright" (14.5 < r < 17.8) and "faint" (17.8 
< r < 20.2) apparent magnitude range along a set of 
sw 150 lines of sight. Due to our signal-to-noise ratio 
cut, this uniform sampling is truncated at a brighter 
magnitude, where the cut-off is different for each line of 
sight. We determine the cut-off for each SEGUE plug- 
plate (which we refer to simply as "plates" in what fol- 
lows) as the faintest star in the color-magnitude box, and 
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model the r-dependence of the selection function using 
a hyperbolic-tangent step around the cut-off. We obtain 
the overall selection fraction for each line of sight by com- 
paring the size of the spectroscopic sample to that of the 
photometric sample in the targeted color-magnitude box 
for each individual line of sight. This model is described 
in more detail in Appendix A. 

3. DENSITY FITTING METHODOLOGY 
3.1. Generalities 

Fitting the spatial-density profiles of various G-dwarf 
sub-samples must account for the fact that the observed 
star counts do not reflect the underlying stellar distribu- 
tion, but are strongly shaped by (a) the strongly position- 
dependent selection fraction of stars with spectra (see 
FIG. 11), (b) the need to use photometric distances that 
in turn depend on the color and metallicity distribution 
of the sample (as the magnitude-limited SEGUE sam- 
ple corresponds to a color- and metallicity-dcpcndcnt 
distance- limited sample), and (c) the pencil-beam na- 
ture of the SEGUE survey. To properly take all of these 
effects into account, we need to use forward modeling: 
in what follows we fit stellar-density models to the data 
by generating the expected observed distribution of stars 
in the spectroscopic sample, based on our model for the 
SEGUE selection function and the photometric distance 
relation; this predicted distribution is then compared to 
the observed star counts. We show below how this can be 
expressed as a maximum likelihood problem. This gen- 
eral density-fitting methodology applies to any spectro- 
scopic survey, with minor modifications, and needs to be 
applied to obtain selection-corrected distributions from 
spectroscopically selected stellar samples. In particular, 
this methodology needs to be applied to constrain the 
structural parameters of abundance-selected samples in 
the Milky Way. 

As the photometric distance estimates depend on the 
g — r color, metallicity [Fe/H], and apparent r-band mag- 
nitude, and because the selection function is a function 
of position, r, and g — r, we need to model the observed 
density of stars in color-magnitudc-mctallicity-position 
space, X(l, b, d,r,g — r, [Fc/H]). This density of stars can 
be written as 

X(l,b,d,r,g-r, [Fe/H]) = 

p(r,g-r,{Fe/Ji\\R,Z,<t>)xv*(R,Z,ct>) (4) 

x \J(R, Z, 4>; I, b, d)\ x S*(plate,r, g - r) . 

Here, (R, Z, (f>) are Galactocentric cylindrical coordi- 
nates corresponding to rectangular coordinates (X, Y, Z) , 
which can be calculated from (I, b, d). The factor p(r, g — 
r, [Fe/H] \R, Z, </>) is the number density in magnitudc- 
color-metallicity space as a function of position (see fur- 
ther discussion in Appendix B). The | J(R, z; I, b, d)\ is a 
Jacobian term because of the (X, Y, Z) — > (I, b, d) coordi- 
nate transformation; the crucial factor S^plate, r, g — r) is 
the selection function as given in equation (A2). Finally, 
(i?, Z, <p) is the underlying spatial number density of 
the sample; we stress that this is a density as a func- 
tion of rectangular coordinates (X, Y, Z) that we evalu- 
ate through (i?, </>) (as we will assume later that the 
density is axisymmetric) , i.e., its dimension is 1 / (spatial 
unit) 3 . In what follows we will assume that our models 



for this density (e.g., exponentials in the vertical and ra- 
dial direction) are characterized by a set of parameters 
denoted as 9 and that the density is axisymmetric, such 
that i/» = v*(R, Z\6). 

The likelihood of a given model for the density 
f*(i?, z\9) is given by that of a Poisson process with rate 
parameter A 

bxC = J2 I ln A ({^ b > d > r, 9 ~ r, [Fc/H]}^)] 

i 

- J dl dbdddr d(g - r) d[Fc/H] X(l, 6, d, r, g - r, [Fe/H] \6) , 

(5) 

where the integral is over the domain surveyed and i in- 
dexes the observed objects. Because the Jacobian, the 
selection function, and the density in magnitude-color- 
mctallicity space only enter A multiplicatively (equa- 
tion (4)) their contribution to the first term (In A) in 
equation (5) is a constant that does not depend on the 
density parameters. Thus, up to a term that does not 
depend on 9, the log likelihood is equivalent to 

ln£ = ]r[ln^(i?,z|{Z,&, d} 4 , 0)] 

i 

-J dldb dd dr d(g - r) d[Fc/H] X(l,b,d,r, g - r, [Fe/H]|0) . 

(6) 

Note that the Jacobian, the density in the magnitudc- 
color-metallicity space, and the selection function only 
enter through the second term, and do not need to be 
evaluated on a star-by-star basis. The second term in 
equation (6) — the normalization integral — can be writ- 
ten as (assuming that the density does not depend on 
(/, b) over the area of a plate, although this can easily be 
relaxed) 

J dldbdddrd(g - r)d[Fc/H] X(l,b,d,r,g- r, [Fe/H]|0) 
= A p J d (-9 - r) d[Fc/H] dr S(p, r, g - r) 

plates p 

J ddp(r, g - r, [Fc/H] \R, Z,4>) d 2 v*(R, z\l, b, d, 9) , 

(7) 

where A p is the area of a SEGUE plate (approximately 
7 dcg 2 ). 

In the following, we analytically marginalize over the 
amplitude of the rate A with a logarithmically flat prior. 
In that case, the log likelihood becomes 

ln£ = 

i 

-In J dldbdddrd(g-r)d{Fe/K\ X(l,b,d,r,g - r, [Fe/H]|0) 

(8) 

Note that the normalization integral is now moved inside 
of the logarithm. 
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In Appendix B, wc discuss how wc in- 
clude the magnitudc-color-metallicity factor 
p(r, g-r, [Fe/H] \R, Z, <f) in the likelihood. 

3.2. Stellar number density models 

We fit number-density models for the various abun- 
dance sub-populations, consisting of a disk with an ex- 
ponential profile in both the vertical and radial direction, 
plus a constant density 

v*{R,Z) = 

"1 / R-Ro' 

2X eXP ' 



N(Ro) 



\Z\ 
h, 



ex P -~T~ + 7T7 



24 



(9) 



where N(Rq) is the vertically- integrated number den- 
sity at Rq. We refer to this model below as a single- 
exponential disk fit, as in all cases the data imply /3 C <C 1. 
We also fit combinations of exponential disks as 

v*(R, Z) = 
N(Ro) 

ft 



2h z 



cxp 



R - Re 

h R 



2h 



z.2 



exp 



R — Ri 



h 



R,2 



exp 



exp 



\z\ 

h z 



(10) 



In particular in the Z direction, this is analogous to tra- 
ditional density fits based on photometric data, which 
require (at least) two exponential components. We do 
not fit for the overall normalization, N(Rq), as we are 
interested primarily in the shape of the stellar-density 
profile. 

To determine the best-fit parameters and their un- 
certainties we use Powell's method for minimization 
(Press et al. 2007), and then MCMC-sample the poste- 
rior distribution function, obtained by multiplying the 
likelihood in equation (8) with fiat logarithmic priors for 
the scale parameters (h z , hn, h z _2, hR.,2) and flat priors 
on the contamination- fraction parameters (/3 C , ^2), using 
an ensemble MCMC sampler (Goodman & Weare 2010; 
Forcman-Mackcy et al., 2011, in preparation). 

3.3. Tests on mock data 

In Appendix D, we discuss tests of the fitting method- 
ology on mock data sets made up of single-exponential 
disk components observed using the SEGUE sampling. 
These tests show that we can recover the input density 
structure to within the MCMC-dctermined uncertainties 
over the range of inferred scale heights, scale lengths, and 
sample sizes found below. 

4. DENSITY STRUCTURE 

First, we briefly discuss the result of fitting the broad 
bins in abundance as defined in equation (2), in order 
to explore the broad trends in spatial structure with el- 
emental abundance. In § 4.3, we then split the sample 
finely in elemental-abundance space and map the struc- 
ture of mono-abundance populations. 

4.1. The a-old stars 

For the a-old sample, the fit results for single expo- 
nential profiles in R and Z, and for a combination of 



two exponential profiles for both R and Z, are given in 
TABLE 1. The model with two exponentials in both R 
and Z is preferred, but the parameters of the dominant 
double-exponential disk are similar for both fits. That 
is, even when we give the model the additional freedom 
of two vertical scale heights, the data lead us to employ 
only a single exponential scale height. There is no evi- 
dence for a thinner component in the a-old abundance 
range. We see that the a-old sample is dominated by a 
population of stars with a scale height of 686±11 pc, and 
a short scale length of 2.01±0.05 kpc (consistent with the 
rough estimate of 2 kpc based on a handful of stars by 
Bensby et al. 2011 and the indirect dynamical estimate 
of 2.2±0.35 kpc of Carollo et al. 2010). 

We have split the a-old sample into more metal-poor 
and more metal-rich sub-samples by cutting the sample 
at [Fe/H] = -0.7. This is close to the median [Fe/H] of 
the a-old sample. The metal-poor sub-sample may be 
identified with the metal-weak thick disk (MWTD) pop- 
ulation discussed by Carollo et al. (2010), which they ar- 
gue covers the metallicity range —1.8 < [Fe/H] < —0.8. 
The resulting fits for the spatial structure of these sub- 
samples are given in TABLE 1. The inferred scale lengths 
for these sub-samples are equal to within the uncertain- 
ties. However, the scale height of the more metal-poor 
sample is 856±20 pc while that of the more metal-rich 
sample is 583±16 pc. The radial scale length of the 
MWTD determined from the indirect dynamical anal- 
ysis of Carollo et al. (2010) is roughly 2 kpc, while the 
scale height is 1.36±0.13 kpc. 

We have also split the a-old sample into two bins in 
[a/Fe] by splitting the sample at [a/Fe] = 0.35. The 
best-fit density profiles, given at the bottom of TABLE 1, 
again have similar scale lengths, around 2 kpc, and dif- 
ferent scale heights. The stars that are most enhanced 
in a-elemcnts have the largest scale height (765±15 pc) 
and the shortest scale length (1.89±0.04 kpc), while the 
less a-enhanced stars have a smaller scale height (627±18 
pc) and longer scale length (2.23±0.1 kpc). As the lat- 
ter dominate the full a-old sample, their scale height is 
very similar to that inferred for the full sample. We ex- 
plore the dependence of the disk parameters on [Fe/H] 
and [a/Fe] in more detail in § 4.3 below. 

4.2. The a-young sample 

The results for single exponential disk fits and double 
exponential disk fits for the a-young sample are given 
in TABLE 2. The double exponential disk fit model is 
formally preferred, but the parameters of the dominant 
double-exponential disk are again similar for both fits. 
We see that the a-young sample is dominated by a pop- 
ulation of stars with a low scale height of 256±4 pc and 
a long scale length of 3.6±0.2 kpc. 

The second double-exponential disk in the best-fit 
model for the a-young sample has a scale height of 
664±132 pc, which is consistent with the scale-height 
measurement of the a-old sample above. However, the 
fraction of stars in this secondary component is too small 
to constrain its scale length, and is conceivably simply a 
result of 'abundance contamination' of the sample. 

Density fits for a-young samples with the same [a/Fe] 
limits as the nominal a-young sample shown in the top 
panel, but that are more metal-poor, are also given in 
TABLE 2. We do not measure any radial density decline 
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TABLE 1 

Results for the q-old G-dwarf sample ( —1.5 < [Fe/H] < -0.25, 0.25 < [a/Fe] < 0.50) 





/, 

n z 
(,P C J 


n-R 
(kpc) 


"z,2 


(Kpc) 


P2 


Q 

Pc 


all plates 


701±5 


2.06±0.03 








0.0000±0.0009 


bright plates 


769±14 


1.79±0.05 








0.004±0.009 


faint plates 


714±11 


2.25±0.05 








O.OOliO.001 


b < 0° 


694±9 


o 09-1-0 0^ 








0000-1-0 001 


b > 0° 


699±8 


2.10±0.04 








O.OOOiO.OOl 


16] > 45° 


696±6 


2.23±0.06 








0.0000±0.0009 


6 < 45° 


640±10 


o 0^+0 04 








002+0 002 


all plates 


686±11 


2.01±0.05 


933±49 


3.0±0.4 


0.04±0.03 




bright plates 


7o4±20 


1.78±0.04 


312o±271 


>64 


U.01±0.02 




faint plates 


688±40 


2.2±0.1 


1311±189 


>3.0 (5±1) 


0.03±0.04 




b < 0° 


671±22 


1.97±0.08 


993±169 


3.7±0.4 


0.05±0.05 




b > 0° 


687±11 


2.06±0.07 


886±?j>° 


3±1 


0.04±0.04 




|6| > 45° 


692±11 


2.2±0.1 


800±88 


4.3±0.4 


0.01±0.07 




\b\ < 45° 


639±17 


2.03±0.07 


1142±99 


>5 


0.01±0.02 




[Fe/H] < -0.7 


856±20 


2.06±0.08 


865±108 


2.1±0.3 


0.07±0.08 




[Fe/H] > -0.7 


583±16 


1.97±0.08 


873±62 


4.0±0.5 


0.03±0.04 




0.25 < [a/Fc] < 0.35 


627±18 


2.23±0.10 


802±104 


3.5±0.3 


0.03±0.06 




0.35 < [a/Fe] < 0.5 


765±15 


1.89±0.04 


826±45 


2.0±0.1 


0.03±0.06 





Note. — Lower limits are at 99 percent posterior confidence. Lower limits are given when the best-fit 
value is larger than 4.5 kpc. The best-fit value is not given if the best-fit value is larger than 6 kpc. 



for these more metal-poor a-young samples, and short 
scale lengths for these samples are ruled out by the data. 
We consider this further in § 4.3 and in the discussion 
section below. 

When we split the a- young sample into two pieces, 
by cutting at [a/Fe] = 0.15, we find that the more a- 
enhanccd sample has the shortest scale length (2.3±0.2 
kpc) and the largest scale height (348±13 pc). The sam- 
ple with [a/Fe] closer to solar has a longer scale length 
of 4.3±0.2 kpc and a smaller scale height of 239±4 pc. 

4.3. The spatial structure of mono- abundance 
sub-populations 

In the previous two sections we found that sub-samples 
of stars defined by their element abundances appear to 
have a simple spatial structure, approximated by a sin- 
gle exponential in the radial and vertical direction. The 
scale lengths and heights of these sub-sets seem to vary 
systematically with the abundances: the a-old sample 
has a shorter scale length than the a-young sample, and 
if we split those two samples further in [a/Fe], the part 
of the a-young sample that has the closest to the solar 
[a/Fe] ratio has the longest scale length and the small- 
est scale height. We also noticed that populations with 
[a/Fe] < 0.25 have longer scale lengths and scale heights 
with decreasing [Fc/H]. 

To further investigate these trends, we have fit disk 
models with single exponential profiles in R and Z to 
sub-populations of stars with narrow bins in [Fc/H] and 
[a/Fe]. We divide stars into bins of width 0.1 dex in 
[Fc/H] and 0.05 dex in [a/Fe], and only fit those bins 
with more than 100 stars. The results from these fits 
are shown in FIG. 4. The populations in the lower left 
part of the [a/Fe]- [Fe/H] diagram all have best-fit scale 
lengths in excess of 4.5 kpc. 

We also fit two-component, i.e., two exponential disks 
to each of the bins, but found that these led to overfitting, 



and only marginal improvements in the likelihood for the 
best fit. Thus, for narrow bins in elemental-abundance 
space, the sub-populations are very- well described by sin- 
gle exponential profiles in the R and Z directions. 

A different view of the results in FIG. 4 is given in 
FIG. 5. The results in the different [Fe/H]-[a/Fe] bins 
are shown as a function of scale length and scale height; 
the points are color-coded by their [a/Fe] or [Fc/H] de- 
pendence, and the size of the points corresponds to the 
total stellar surface-mass density — corrected for mass 
and sample selection effects-in each population (calcu- 
lated in Bovy et al. 2012a). FIG. 5 also shows the un- 
certainties in the inferred parameters; the formal uncer- 
tainty in the scale height for some points is so small that 
it cannot be seen. The bins with dashed error bars lie 
in a part of the abundance plane where abundance con- 
tamination is likely to be the most severe, where the 
[a/Fe]-based age ranking is least reliable, and where the 
spatial properties change most rapidly. They contain < 
5 percent of the disk surface mass. 

We see that these fits for mono-abundance sub- 
components flesh out the main trends we noted in the 
broader [Fe/H] and [a/Fe] ranges above. At any given 
metallicity [Fe/H], the scale length increases and the 
scale height decreases when moving from a-old to a- 
young populations. At any given a-age, the scale length 
and the scale height increase for the more metal-poor 
components, implying an outward metallicity gradient. 
And, as FIG. 5 shows most clearly, increasing scale 
lengths are correlated with decreasing scale heights (ex- 
cept for a few bins on the boundary between the very long 
scale lengths at low metallicity and solar a-enhancement 
and the shorter scale lengths of the a-old populations; 
see further discussion in § 5.5). From FIG. 4 it is clear 
that neither [a/Fc] or [Fc/H], on its own, accounts for the 
trends in scale height and scale length. We discuss what 
this implies for disk formation and evolution in §§ 5.4 
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TABLE 2 

Results for the a- young G-dwarf sample ( -0.3 < [Fe/H] < 0.25, 0.00 < [at/Fe] < 0.25) 





h z 


h R 


h Zl 2 


hR,2 


ft 


/3 C 




(PC) 


(kpc) 


(PC) 


(kpc) 






all plates 


270±3 


3.8±0.2 








0.0005±0.0010 


bright plates 


267±3 


3.6±0.2 








0.0009±0.0003 


faint plates 


329±14 


>3.8 (5.1±1.0) 








0.0010±0.0003 


b < 0° 


264±4 


3.6±0.2 








0.0008±0.0009 


b > 0° 


271±4 


3.80±0.10 








0.000±0.001 


161 > 45° 


270±5 


4.2±0.8 








0.0004±0.0008 


[6 < 45° 


264±3 


4.0±0.2 








0.0006±0.0007 


all plates 


256±4 


3.6±0.2 


664±132 


>5 


0.012±0.004 




bright plates 


260±5 


3.5±0.3 


491±83 


>2 


0.02±0.02 




faint plates 


268±23 


>3.8 (5.0±0.8) 


910±152 


>2.9 (6±2) 


0.014±0.008 




b < 0° 


242±8 


3.2±0.2 


639±81 


>5 


0.017±0.010 




b > 0° 


263±6 


3.7±0.2 


834±70 


>4 


0.004±0.002 




161 > 45° 


249±6 


3.8±0.8 


631±142 


>6 


0.015±0.005 




[6 < 45° 


252±5 


3.9±0.3 


656±65 


>5 


0.012±0.005 





-1.5 < [Fe/H] < -0.6 1 689±25 

-0.6 < [Fc/H] < -0.3 1 360±9 

0.00 < [a/Fc] < 0.15 239±4 

0.15 < [a/Fe] < 0.25 348±13 



>37 
>16 

4.3±0.2 
2.3±0.2 



,+704 
-1916 

946±92 



1431" 



l.ll?;® 0.03±0.07 
>14 0.018±0.009 



647±53 
959±335 



>7 

>2.0 (5±2) 



0.010±0.003 
0.018±0.009 



Note. — Lower limits are at 99 percent posterior confidence. Lower limits are given when the best-fit value 
is larger than 4.5 kpc. The best-fit value is not given if the best-fit value is larger than 6 kpc. 

1 These samples have the same [a/Fe] range as the nominal a-young sample. 



and 5.5, respectively. 

FIG. 6 shows the results of fitting two components 
with exponential profiles in both R and Z to each abun- 
dance bin. The scale height of the dominant component 
is shown against the best-fit scale height, when fitting a 
single exponential profile in R and Z. We see that these 
scale heights are strongly clustered around the one-to- 
one correspondence line. Thus, for each bin, a single ver- 
tical exponential suffices to explain the observed number 
counts. The fact that the two measurements agree better 
than would be expected, given the uncertainties shown, 
is due to the fact that the scale heights for each bin are 
strongly correlated when fitting a single or a double ex- 
ponential profile in R and Z . Overall, FIG. 6 confirms 
that a single exponential model in Z and R is a good 
model for the spatial structure of mono-abundance sub- 
populations. 

In Appendix D, we perform a test to determine 
whether abundance uncertainties can plausibly lead us 
to find spurious disk components between a "thin" and 
a "thick" component. That is, we ask whether it is plau- 
sible that an underlying density dominated by distinct 
thin- and thick-disk components can be smoothed by 
abundance errors into the density structure we inferred 
in FIG.s 4 to 6. This test shows that if this were the case, 
every bin is preferentially fit with two components, cor- 
responding to the input thin and thick components. The 
equivalent of FIG. 6, shown in the bottom right panel 
of FIG. 20, is qualitatively different, with a distinct dif- 
ference between the single-component scale height and 
the scale height of the dominant component in the two- 
component fit. 

To test whether the analysis in this section is influ- 
enced by our signal-to- noise ratio cut of SN > 15, we 
have repeated the analysis with a cut of SN > 30, as also 
used by Lee et al. (2011b). The equivalents of FIG.s 4 



to 6 look qualitatively the same, albeit with larger un- 
certainties for each bin, and the dependence of h z and 
ha on elemental abundance is the same as that inferred 
from the sample with the SN > 15 cut. The number of 
([Fc/H], [a/Fe]) bins with more than 100 stars is smaller, 
but the inferred (h z ,hn) for those bins with more than 
100 stars when using SN > 30 cut are consistent within 
the uncertainties with those found with the less restric- 
tive signal-to- noise ratio cut. We stress that even when 
selecting stars with SN > 30, the equivalent of FIG. 6 
does not show any sign of a second component in the 
mono-abundance bins. 

To perform the binning in this section, we used narrow 
bins of 0.1 dcx in [Fc/H] and 0.05 dcx in [a/Fc]. These 
bins are somewhat narrower than the total typical un- 
certainty (« 0.15 dcx in [Fc/H], w 0.07 dex in [a/Fe]; 
Bovy et al. 2012b), but we prefer to oversamplc, rather 
than undcrsamplc, to avoid smoothing out underlying 
structure. The analysis in each bin holds irrespective 
of the bin size. What matters for the analysis is that 
the data in each bin are disjoint, such that the bins are 
statistically independent. 

5. DISCUSSION 

Our basic result is that various stellar disk sub- 
components, when defined purely through stellar abun- 
dances, are simple, i.e., can be described by a single expo- 
nential in R and Z, and exhibit distinctive trends of the 
scale height and scale length with chemical abundance. 
This suggests that dissecting the Milky Way's disk on the 
basis of chemical abundances alone is a useful approach. 
In this section we go through a number of practical issues 
pertaining to these estimates, before discussing possible 
implications for galactic disk formation and evolution. 

5.1. Distance systematics 




1000 



[Fe/H] 




[Fe/H] 



Fig. 4. — Radial scale length (left panel) and vertical scale height 
(right panel) of the best-fitting density models with a single ex- 
ponential in R and Z, shown for different mono-abundance sub- 
populations as a function of their position in the [Fe/H] and [a/Fe] 
plane. Pixels span 0.1 dex in [Fe/H] and 0.05 dex in [a/Fe]. Only 
pixels with more than 100 stars are shown. The striking correla- 
tions between the spatial structure and the elemental abundances 
of the sub-populations is apparent. 



The absolute values of the distance scales measured in 
this paper are subject to distance systematics, which we 
discuss in this subsection. We have used the data-driven 
photometric-distance relation from Ivezic et al. (2008) to 
infer the spatial structure of the various samples of stars, 
but an alternative photometric-distance relation can be 
obtained by using the An et al. (2009) stellar isochroncs 
in the SDSS passbands. These isochrones depend on 
[Fc/H] as well as on [a/Fe], although in practice a lin- 
ear relation between [a/Fe] and [Fe/H] is assumed, and 
the spectroscopically measured [a/Fe] is not used di- 
rectly to estimate the photometric distance. In the top 
panel of FIG. 1 we compared the distance moduli de- 
rived using the An et al. (2009) stellar isochrones with 
those obtained using the Ivezic et al. (2008) relation for 
a few values of [Fe/H]. We see that, for the values of 
[Fc/H] that span most of our sample, the distance modu- 
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Fig. 5. — Correlation between the radial scale length and verti- 
cal scale height of mono-abundance sub-populations, essentially a 
different projection of the results in FIG. 4. The top panel shows 
the vertical scale height versus the radial scale length, color-coded 
by [a/Fe]. The bottom panel shows the same as the top panel, 
but with [Fe/H] color-coding. Bins with best-fit scale lengths in 
excess of 5 kpc are indicated with lower limits in the bottom row. 
The area of the disks are proportional to the total surface-mass 
density contained in each point's [Fe/H]— [a/Fe] bin, corrected for 
mass and sample selection effects (see Bovy et al. 2012a). Some of 
the uncertainties on the vertical scale heights are smaller than the 
points. The bins with dashed errorbars lie in a part of the abun- 
dance plane where abundance contamination is likely to be most 
severe, where the [a/Fe]-based age ranking is least reliable, and 
where the spatial properties change most rapidly. They contain < 
5 percent of the disk surface mass. 



lus difference is -0.12 mag, corresponding to a systematic 
difference in the inferred distances of about 6 percent, 
nearly independent of color. Thus, if we had used the 
An et al. (2009) photometric distances we would have ob- 
tained scale lengths and scale heights that were 6 percent 
shorter. 

A second distance systematic that could influence our 
results is the Malmquist bias (Malmquist 1920, 1922) — 
the fact that brighter stars are over-represented in a 
magnitude-limited survey. For our relatively bright sam- 
ple, this is dominated by the finite width of the pho- 
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Fig. 6. — Scale height of the dominant component, when fitting 
two exponential components in R and Z, versus the scale height 
for a single exponential component in R and Z. The dashed line 
gives the one-to— one correspondence. For most mono-abundance 
populations a single exponential description in Z is a good model. 

tomctric distance relation. The Malmquist bias in ab- 
solute magnitude is apparent-magnitude dependent and 
approximately equal to — <7 2 dlnyl(r)/dr, where A(r) is 
the differential number count as a function of apparent 
magnitude and a is the dispersion in the absolute mag- 
nitudes (either due to photometric uncertainties or due 
to intrinsic scatter in the photometric distance relation) . 
Conservatively assuming that the combination of the fi- 
nite width of the photometric distance relation and the 
photometric uncertainties is 0.2 mag, and that the un- 
derlying density is constant, the Malmquist bias would 
be of order 2.5 percent. However, due to the exponential 
fall-off of the density in both the R and Z directions, 
the differential number counts are (a) flat near the peak 
induced by the vertical exponential, and (b) for most ap- 
parent magnitudes |dlnA(r)/dr| is less than 1. There- 
fore, the Malmquist bias is at most about 2 percent, and 
will not strongly affect the measurement of the vertical 
scale height in particular. 

We have assumed throughout our analysis that all of 
the stars in our sample are single. The presence of un- 
resolved binaries will lead us to underestimate scales, as 
these binaries will appear to us as brighter, and thus 
closer, single stars. The binary fraction and companion- 
mass distribution for G-type dwarfs remains controver- 
sial, but it appears that the overall binary fraction 
for G dwarfs is approximately 40 percent (Abt & Levy 
1976; Duquennoy & Mayor 1991; Raghavan et al. 2010), 
similar to but slightly larger than that of M dwarfs 
(Fischer & Marcy 1992; Raghavan et al. 2010). The dis- 
tribution of companion masses is poorly known, and 
could range from being peaked around 20 percent of the 
primary's mass (Duquennoy & Mayor 1991), to being 
relatively flat between 20 and 100 percent of the pri- 
mary's mass (Raghavan et al. 2010), with numerical sim- 
ulations indicating that multiple-star systems form pref- 
erentially with approximately equal-mass members (Bate 



2005), and an overall multiplicity fraction of around 
40 percent (Bate et al. 2003). Lower- metallicity stars 
most likely have a higher binary fraction (Machida et al. 
2009), and could reach 100 percent for [Fe/H] < -0.8 
(Raghavan et al. 2010). 

For a likely scenario where 40 percent of our a-young 
sample is made up of binary stars (ignoring higher- 
order multiplicities) with a flat distribution of compan- 
ion masses between 20 and 100 percent of the primary's 
mass, the magnitude would be overestimated on aver- 
age by 0.12 mag, such that the scale height and scale 
length would be underestimated by about 6 percent. If 
70 percent of the a-old sample would consist of binary 
systems (taking into account the rising binary fraction 
with decreasing metallicity), the magnitudes would be 
overestimated by approximately 0.21 mag, and the a- 
old scale heights and scale lengths would be underesti- 
mated by 10 percent. These biases are somewhat larger 
than the statistical uncertainties on our results, but they 
are similar to the overall distance-scale uncertainty (see 
above), and they do not change the conclusion that the 
a-old scale length is much shorter than that of the a- 
young sample. Even in a worst-case scenario, where all 
binary systems have equal-mass companions and where 
100 percent of the a-old stars are in binaries, the a-old 
scale-length would still be < 2.8 kpc (40 percent up from 
2 kpc), which is shorter than the scale length measured 
for the a-young sample in TABLE 2 and FIG. 5, and the 
a-young scale lengths themselves would also increase by 
about 15 percent in this scenario. In principle, a careful 
spectral analysis of the SEGUE spectra itself could pro- 
vide direct constraints on the (unresolved) binary con- 
tamination in this sample. 

5.2. Halo contamination 

In our density fits we have mostly fit disk components 
to the data, except for the single exponential disk model 
where we added a uniform density (equation (9)). We 
thus assumed that the stellar halo does not influence 
our disk fits, beyond what can be described by a uni- 
form density across our survey volume. We can esti- 
mate the expected number of halo stars in our sample 
using the Bell et al. (2008) density fits to the smooth 
stellar halo. We run the Bell et al. (2008) stellar-halo 
density through the G-star SEGUE selection function, 
and marginalize over g — r color using a flat distribu- 
tion over 0.48 < g — r < 0.55, and over [Fc/H] using the 
Ivezic et al. (2008) halo metallicity distribution (mean 
[Fe/H] = -1.52, width = 0.32). We then find that for 
w 10 8 G-type stars between 1 and 40 Galactocentric 
kpc in the stellar halo, there should be about 100 halo 
stars in our sample, compared to the total sample size 
of 30,353 G-type dwarfs. Hence, the halo contamina- 
tion is very small and does not influence the fits. Addi- 
tionally, halo contamination will be most severe for the 
a-old sub-populations, and this contamination should 
work to increase the inferred scales (length and height). 
Therefore, the result that the radial scale length of a- 
old sub-populations is shorter than that of a-young sub- 
populations is robust against any halo contamination. 

5.3. Comparison to traditional geometric disk 
decompositions 



The density fits in this paper are the first to con- 
strain the vertical scale height and radial scale length 
of numerous disk sub-components, defined using el- 
emental abundances alone, from a large sample of 
stars. Our results show that the vertically thicker 
disk sub-components — when chemically defined — have a 
much shorter scale length than the thinner-disk sub- 
components, which is opposite to traditional purely geo- 
metric disk decompositions (e.g., Robin et al. 1996; Ojha 
2001; Larsen & Humphreys 2003), which typically find 
that the thick-disk component has a longer scale length 
than the thin disk, and that this scale length is > 3.5 
kpc (e.g., Juric et al. 2008). 

When we fit the spatial structure in our approach, tak- 
ing stars of all mctallicitics (specifically, the combination 
of our a-old ("thick") and a-young ("thin") samples), 
we can recover the result of purely geometric decompo- 
sitions: the thin-disk component — i.e., the component 
with the lowest scale height, w 300 pc — gets paired with 
the shortest scale length (« 2 kpc), while the thicker-disk 
component gets assigned both the largest scale height 
and scale length (for our particular sample fit with a 
combination of three double-exponential disks these are 
« 600 pc and « 2.4 kpc, with a small component with 
an even larger scale height and scale length). Thus, it 
seems that purely geometric decompositions naturally 
associate the longest scale length with the largest scale 
height. That both geometrically determined scale lengths 
are shorter than the scale length of the a-young sample 
is due to the fact that the metallicity distribution for 
the entire sample extends down to [Fc/H] = -1.5, such 
that the model 'expects' many low-metallicity stars in the 
"thin" component at large distances (as the model does 
not contain the information that the thin component has 
higher metallicities), which are not observed. Therefore, 
metallicity and a-element abundances, which are man- 
ifestly quantities that can identify sub-samples of stars 
independent of their spatial structure and kinematics, 
lead to a qualitatively different decomposition into two 
(or more) sub-components than the purely geometrical 
approach, with its inherent risk of circular reasoning. 

5.4. Implications for disk formation 

The distinctive changes of the global disk structure 
with abundance, especially with the age proxy [a/Fe], 
should provide valuable clues to the formation of the 
Milky Way's disk. While a concrete and quantitative 
model comparison is beyond the scope of this paper, 
we discuss some of the qualitative implications here. 
As mentioned in Section 1, the overall radial-density 
profile of the stellar disk is expected to be conserved 
even in the face of large-scale radial migration, but the 
radial profile of sub-components will tend to relax to 
the mass-weighted mean radial profile. Thus, a differ- 
ence in the radial distribution of various populations of 
stars today is a less-pronounced version of more differ- 
ent initial radial distributions (at formation). Assuming 
that the [a/Fe] ratio is an adequate proxy for age (e.g., 
Schonrich & Binney 2009b), our results then imply that 
the a-enhanced, hence oldest, populations are more cen- 
trally concentrated — have a shorter scale length — than 
populations with a-abundances that are closer to Solar, 
and therefore younger. This is direct observational ev- 
idence for inside-out formation of galactic disks across 
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[Fe/H] 

Fig. 7. — Mean orbital radii of the G-dwarf sample. Median 
values of the mean orbital radii are shown, in bins of width 0.05 
dcx in [Fe/H] and 0.025 dex in [a/Fe]. Only bins with at least 20 
stars are shown. 

the presumed age range of our sample, 1-10 Gyr, where 
the inner parts of the disk form before the outer part of 
the disk. A similar age-dependence of the exponential 
scale length has been found in several external galaxies 
(de Jong et al. 2007; Radburn-Smith et al. 2012). 

Second, our analysis shows that our Milky Way has not 
only a metallicity gradient among its youngest stars, but 
that it has always had one (Cheng et al. 2011): at a given 
[a/Fe], standing in for age, sub-populations with lower 
[Fc/H] have a longer scale length than more metal-rich 
stars. This picture is confirmed by looking at the orbital 
properties of the stars when integrating our sample of 
G-type dwarfs in a simple model for the Milky Way's 
potential, made up of a Miyamoto-Nagai disk with a ra- 
dial scale of 4 kpc and vertical scale of 300 pc contribut- 
ing 60 percent of the radial force at the Solar radius, a 
Hernquist bulge with a scale radius of 600 pc contribut- 
ing 5 percent of the radial force, and a Navarro- Frenk- 
White halo with a scale radius of 36 kpc that contributes 
35 percent of the rotational support at the Sun's position. 
The median of the mean orbital radii as a function of ele- 
mental abundance is shown in FIG. 7 (see also Lee et al. 
2011b and Liu & van de Ven, 2011, in prep, for sim- 
ilar figures of the eccentricity and rotational velocity). 
We see that stars with [a/Fc] < 0.25 and lower [Fc/H] 
are thin-disk stars that live, on average, farther out than 
more metal-rich stars. Thus, the longer scale length for 
outer-disk stars, combined with the fact that, for solar 
[a/Fe] decreasing [Fc/H] is correlated with decreasing 
age (e.g., Schonrich & Binney 2009b), implies that the 
outer part of the disk formed later than the inner part. 

We have assumed that [a/Fe] is an adequate proxy 
for age, such that the mono-abundance populations that 
are more [a/Fe]-enhanced are older than the populations 
with solar [a/Fe]. This is typically the case in standard 
scenarios for the star formation history of the Milky Way 
disk, in which [a/Fe] steeply drops around 2 to 3 Gyr 
due to the onset of type la supernovae (Dahlen et al. 
2008; Maoz et al. 2011), and then stays roughly con- 
stant, although the value of [Fe/H] at which the [a/Fe] 
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downturn happens depends on the star formation history 
(Matteucci & Recchi 2001). Only if the local star forma- 
tion was characterized by bursts of star formation can 
younger populations of stars have similar levels of [a/Fe] 
as older stars (Gilmore & Wyse 1991). Most current fits 
of the local star formation history prefer a smooth his- 
tory (e.g., Aumer & Binney 2009), although it is dim- 
cult to rule out epochs of enhanced star formation (e.g., 
Rocha-Pinto et al. 2000). 

5.5. Implications for disk evolution 

The spatial structure inferred for mono-abundance 
sub-populations (FIG.s 4 and 5) show two important re- 
sults: first, there is a tight anti-correlation between the 
scale heights and scale lengths of the sub-components. 
Secondly, there is a continuous distribution in scale 
height when moving from a-enhanced, metal-poor pop- 
ulations to stars with solar [a/Fe] and [Fe/H]. This 
suggests that the a-old — "thick" — and the a-younger, 
"thin", regime of the stellar disk are not two sep- 
arate entities, but merely opposite ends of the disk 
evolution spectrum (suggested before in Norris 1987, 
but never directly measured as we do here). This 
issue, which requires proper stellar-mass weighting of 
the sub-components, is worked out in a separate paper 
(Bovy et al. 2012a). Taken together, these findings sug- 
gest a continuous evolutionary mechanism created the 
observed scale-height distribution, rather than a discrete 
external heating or accretion event. Radial migration is 
an obvious candidate for this internal evolution mecha- 
nism. That the most centrally concentrated component 
of the disk is not only the (a-) oldest part, but also has the 
largest scale height, is a nearly inevitable condition, and 
hence a natural prediction, of any scenario where much of 
the disk scale-height distribution is created through ra- 
dial migration. The a-old sub-population not only had 
the most time to evolve, but its centrally concentrated 
parent population implies that stars at 6 < R < 12 kpc 
have migrated out by the largest factor. 

A different internal explanation for the thicker disk 
components in the Milky Way is that, rather than be- 
ing thickened over the history of the Galactic disk, thick 
components were created thick during an early, turbulent 
phase in the formation of the disk (e.g., Bournaud et al. 
2009; Forster Schreiber et al. 2009). If such a scenario is 
combined with a inside-out growth of the disk, and the 
disk remains turbulent over a significant fraction of its 
history, this formation scenario could plausibly explain 
the continuous dependence of disk structure on elemen- 
tal abundance found in this paper. 

Our result that the transition between the a-young, 
"thin" , components and the a-old, "thick" , components 
is smooth, rather than showing a clear separation be- 
tween thin and thick components, may appear to be 
in conflict with local, high-resolution spectroscopic sam- 
ples of stars (e.g., Reddy et al. 2006; Fuhrmann 2011; 
Navarro et al. 2011) or other analyses of the SEGUE 
data (e.g., Lee et al. 2011b). A detailed comparison be- 
tween these and our results requires careful account- 
ing for the spectroscopic volume sampling, which has 
not been done in the Lee et al. (2011b) analysis or for 
the high-resolution samples, except for the sample of 
Fuhrmann (2011), which is volume complete out to 25 
pc. Without taking the volume selection into account, 



the sample used here also displays a bi-modality in the 
[Fe/H]-[a/Fe] plane (see FIG. 2). We discuss this issue in 
more detail in Bovy et al. (2012a), but we note here that 
the apparent bi-modality in the observed number den- 
sity of stars disappears when properly correcting for the 
spectroscopic sampling. Furthermore, the local, high- 
resolution analyses cannot directly measure the spatial 
distribution of stars of different elemental abundances 
(e.g., Fuhrmann 2011, which only has 15 high- [a/Fe] 
stars out to 25 pc; Reddy et al. 2006) and therefore rely 
on kinematics to argue that the vertical distribution of 
stars in the Solar neighborhood is characterized by a bi- 
modal "thin" -"thick" -disk dichotomy. This interpreta- 
tion is driven by the selection of stars that are disjoint 
in [a/Fe], which leads to disjoint kinematics because the 
kinematics is a strong — and smooth — function of abun- 
dance as well (Bovy et al. 2012b). While the stellar con- 
tent of different survey volumes can (and should) be con- 
nected by dynamics, we note that the effective volumes 
sampled by, e.g., the Fuhrmann (2011) survey and by our 
analysis differ by a factor of about 10 5 ; hence the extrap- 
olation from one to the other is enormous. The analysis 
of the vertical kinematics of stars in our sample confirms 
the existence of the intermediate populations with scale 
heights between 400 and 600 pc and vertical- velocity dis- 
persions of 30 to 35 km s _1 (Bovy et al. 2012b). 

Our finding that the scale length does not behave as 
smoothly as the scale height, as a function of [a/Fe], 
is presumably a consequence of the disk's formation his- 
tory: here the increasing metallicity as a function of time 
(i.e., youth) and the radial metallicity gradient compete. 
As the mapping between [a/Fc] and age is not linear, 
but rather, [a/Fe] steeply drops around 2 to 3 Gyr due 
to the onset of type la supernovae (Matteucci & Recchi 
2001; Dahlen et al. 2008; Maoz et al. 2011), and then 
stays roughly constant, the scale length should change 
similarly rapidly with [a/Fe]. The scale height, however, 
is determined by subsequent evolution, where radial mi- 
gration transports stars to larger Galactocentric radii, 
where the lower disk density allows them to travel far- 
ther from the plane. Since this evolution is continuous, 
rather than sudden, and includes additional contribu- 
tions from heating, trends in scale height versus elemen- 
tal abundance should be expected to be smoother, even if 
radial migration is not the disk's dominant evolutionary 
mode. Our results are therefore consistent with a sce- 
nario where the thick-disk component is the inner part 
of the disk that formed at the earliest time, and either 
by having formed thick or through the effect of radial 
migration, has a large scale height at the present time. 

A gas-rich merger, followed by intense star formation 
at an early time, could have affected the formation of the 
early disk (Brook et al. 2004), as seems consistent with 
the observed distribution of eccentricities of the thick- 
disk component (Sales et al. 2009; Dierickx et al. 2010; 
Wilson et al. 2011). However, it would lead to a scale 
length for the thicker component that is larger than that 
of the thinner component (Qu et al. 2011). It is clear 
that internal mechanisms must have played an impor- 
tant role during the evolution of the disk. However, we 
caution that the radial and vertical consequences of nei- 
ther radial migration, nor turbulent disk evolution, nor 
of satellite thickening, have been worked out in quantita- 
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tive detail, and, in particular, resonant coupling between 
satellites and the disk might induce some similar obser- 
vational signatures to radial migration. 

The rapid change in the mean stellar population in an 
[a/Fc]- [Fe/H] abundance bin at the onset of type la su- 
pernovae is also likely the explanation for the presence 
of the few points of intermediate [cc/Fe] and [Fe/H] in 
FIG. 5 that do not follow the anti-correlation between 
scale height and scale length; these bins, which do not 
contribute significantly to the total stellar mass (indi- 
cated by the size of the symbols in FIG. 5), are also the 
bins that fall short of the one-to-one correlation between 
single- and two-disk fits in FIG. 6. This provides fur- 
ther evidence of the fact that at the rapid [a/Fe] (age) 
transition our bins do not adequately resolve single com- 
ponents. 

6. CONCLUSIONS 

The main conclusions of this paper are as follows 

• An assessment of the global (R, Z) structure of the 
Milky Way's stellar disk for sub-components selected 
solely by their elemental abundances is now feasible, e.g., 
with spectroscopic surveys such as SEGUE, but requires 
a thorough accounting for the effective selection function 
of the spectroscopic sample. 

• A decomposition of the Galactic disk, based on 
SDSS I SEGUE data for G-type dwarfs, into mono- 
abundance sub-populations in the [Fe/H]-[a/Fe] plane, 
reveals that each such component has a simple spatial 
structure that can be described by single exponential pro- 
files in both the vertical and the radial direction. 

• Adopting increasing levels of [a/Fe] enhancement as 
a proxy for the increasing age of the stellar population, 
the disk dissection into narrow mono-abundance popula- 
tions in the space of [Fe/H] and [a/Fc] exhibits a contin- 
uous trend of increasing scale height and decreasing scale 
length, when moving from younger to older populations 
of stars. 

• We find that the oldest — most a-enhanced — part of 
the disk is both the thickest and the most centrally con- 
centrated. If we split the sample in only two broad abun- 
dance regimes we can make a precise determination of 
the a-old scale length, 2.01±0.05 kpc, and scale height, 
686±11 pc. The scale length of the a-younger disk is 
around 3.5 kpc (3.6±0.2 kpc for our nominal a-young 
sample) and is far thinner, with a vertical scale height of 
256±4 pc. 

• These observations show quite directly that the bulk 
of the Galactic disk has formed from the inside out. 

• The tight (anti-) correlations between population 
age, vertical scale height, and radial scale length strongly 



suggest that the disk's subsequent evolution must have 
been heavily influenced by internal mechanisms, such 
as radial migration or turbulent, gravitationally-unstable 
disk evolution, as this naturally explains the continuous 
increase of scale height with decreasing scale length. At 
first sight, external mechanisms to form the Milky Way's 
thick disk component through external heating or accre- 
tion appear to be inconsistent with our results, but a 
thorough model comparison is warranted. 

While, at face value, our results emphasize the im- 
portance of evolutionary processes that could be purely 
internal to the Milky Way (radial migration, turbulent 
disk formation), the overall ACDM cosmogony makes it 
likely that external processes must also have played some 
role. In the end, it is likely that the Milky Way disk's 
formation history may be more complex than inferred 
here, especially once not only the spatial distribution but 
also the orbital distribution of the mono-abundance sub- 
populations is fully analyzed. 

It is a pleasure to thank the anonymous referee, James 
Binney, Doug Finkbeiner, Dan Foreman-Mackey, Patrick 
Hall, Juna Kollmeier, George Lake, Rok Roskar, Scott 
Tremaine, Glenn van de Ven, and Lan Zhang for help- 
ful comments and assistance. We thank the SEGUE 
team for their efforts in producing the SEGUE data set, 
and Connie Rockosi and Katie Schlesinger in particu- 
lar for help with the SEGUE selection function. Sup- 
port for Program number HST-HF-51285.01-A was pro- 
vided by NASA through a Hubble Fellowship grant from 
the Space Telescope Science Institute, which is oper- 
ated by the Association of Universities for Research in 
Astronomy, Incorporated, under NASA contract NAS5- 
26555. J.B. and D.W.H. were partially supported by 
NASA (grant NNX08AJ48G) and the NSF (grant AST- 
0908357). D.W.H. is a research fellow of the Alexander 
von Humboldt Foundation of Germany. J.B. and HW.R 
acknowledge partial support from SFB 881 funded by the 
German Research Foundation DFG. Y.S.L. and T.C.B. 
acknowledge partial funding of this work from grants 
PHY 02-16783 and PHY 08-22648: Physics Frontier Cen- 
ter / Joint Institute for Nuclear Physics (JINA), awarded 
by the National Science Foundation. 

Funding for the SDSS and SDSS-II has been pro- 
vided by the Alfred P. Sloan Foundation, the Partic- 
ipating Institutions, the National Science Foundation, 
the U.S. Department of Energy, the National Aeronau- 
tics and Space Administration, the Japanese Monbuka- 
gakusho, the Max Planck Society, and the Higher Educa- 
tion Funding Council for England. The SDSS Web Site 
is http://www.sdss.org/. 



APPENDIX 

THE SEGUE G-STAR SELECTION FUNCTION 

To determine the spatial distribution of the G dwarfs, we require a good understanding of the SEGUE G-star 
selection function, i.e., the fraction of stars that has been targeted by SEGUE and produced good enough spectra to 
derive the parameters needed in the present (or any other) analysis (e.g., SN > 15), and we need this selection fraction 
as a function of position, color, and apparent magnitude. The observed density of G-type stars is simply the product 
of the underlying density with the sampling selection function, suggesting that one constrains this underlying density 
by forward modeling of the observations. 

The spectroscopic G-star target type was selected uniformly from the set of objects in the G-star color-magnitude 
box in the area and apparent magnitude range of the spectroscopic plug-plates (simply "plates" hereafter), thus the 
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Fig. 8. — Distribution of the photometric sample of G-type stars (linear density grayscale; black curves) and the spectroscopic sample 
(white contours, dashed histograms) after the signal-to-noise ratio cut of SN > 15. The contours contain 68, 95, and 99 percent of the 
distribution. 

selection function can be reconstructed. The SEGUE survey implementation distinguishes between "bright" and 
"faint" plates, with bright plates containing stars with r < 17.8 mag and faint plates containing stars with r > 17.8 
mag. For the purposes of the selection function, we assume that this separation at 17.8 mag is a hard cut, even though 
in reality some stars were observed on both bright and faint plates for calibration purposes, and some "bright" stars 
are part of faint plates, and vice versa, because of changes between the photometry used for target selection and that 
released as part of the SDSS DR7, which we employ here. Duplicates are resolved in favor of the higher signal-to-noise 
ratio observation (typically on the faint plate as this has a longer integration time). We retain stars with r > 17.8 
mag when they were observed on a bright plate, and we keep objects with r < 17.8 mag when they were observed on 
a faint plate, even though this should not happen in our model for the SEGUE selection function below. A total of 
586 stars in the a-old sample and 47 stars in the ct-young sample fall into this category; they do not influence any of 
the fits or conclusions in this paper. 

We select the superset of targets by querying the SDSS DR7 imaging CAS 9 for all potential targets in the color- 
magnitude box of the G-star target type in the area of a SEGUE plate (Yanny et al. 2009). These objects arc 
primary 10 detections (removing duplicates and objects from overlapping imaging scans) with stellar PSFs (type equal 
to 6). Objects must not be saturated, nor be close to the edge, nor have an interpolated PSF (interp_psf ), and must 
not have an inconsistent flux count (badcounts). Furthermore, if the center is interpolated (interp_center), there 
should not be a cosmic ray indicated (cr). See Stoughton et al. (2002) for a description of the SDSS photometric flags. 
Using the superset of targets we determine for each plate the fraction of stars that were observed spectroscopically of 
all available targets. 

To infer the dependence on color and apparent magnitude of the selection function, we look at the distribution of 
the potential G-star targets in color-magnitude space. This is shown in FIG. 8. The distribution of the spectroscopic 
sample is overlaid. This shows that the spectroscopic sampling is relatively fair in g — r color, with some frayed edges 
because of changes between target and current photometry, and that the selection as a function of r-band magnitude 
tapers at the faint end, as should be expected when using a signal-to-noise ratio cut. If all SEGUE plates were 
integrated to the same depth, the signal-to-noise ratio cut should be a clean cut in r, but it is clear from FIG. 8 that 
this is not the case. To distinguish between relatively shallow and relatively deep plates, we introduce the overall plate 
signal-to-noisc ratio plateSNjr 

platcSN_r = (snl_l + sn2_l)/2 , (Al) 

where snl_l and sn2_l are the r-band plate signal-to-noise ratio for the two SDSS spectrographs (see TABLE 17 in 
Stoughton et al. 2002). The faintest spectroscopic G-type star per plate as a function of plateSNjr for the faint plates 
is shown in FIG. 9 for the faint plates. This figure shows that there is a clear difference in the faintest object that 
could have been successfully observed at SN > 15 between relatively shallow and relatively deep plates. The bottom 

9 http://cas.sdss.org/dr7/en/ . an( j 

10 Sec http://sdss3.org/dr8/algorithms/bitmask_flagsl.php http://sdss3.org/dr8/algorithms/bitmask_flags2.php for a dc- 

scription of these flags. 
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Fig. 9. — Maximum apparent r-band magnitude per plate versus overall plate signal-to-noise ratio plateSN_r for the G-star sample with 
SN > 15 for faint plates {top panel). Signal-to-noise ratio SN for stars on four typical SEGUE plates as a function of the apparent r-band 
magnitude {bottom panel). The four plates have been chosen to show a range in overall plate signal-to-noise ratio plateSN_r. 

panel of FIG. 9 shows the signal-to-noise ratio of stars on four plates chosen to cover a range in the overall plate 
signal-to-noisc ratio. This shows that the SN > 15 cut for the entire sample translates into a fairly sharp r-band cut 
for each individual plate. 

Our model for the SEGUE G-star selection function is then the following: For each plate we find the faintest targeted 
object in r-band magnitude with SN larger than our signal-to-noise-ratio cut, with apparent magnitude r cut (if this 
object is fainter than the nominal limit r max for bright or faint plates, we set r cut equal to this limit; r max = 17.8 mag 
for bright plates and 20.2 mag for faint plates), and then assume that the selection function for that plate is given by 
a hyperbolic tangent cut-off, centered on r cut — 0.1 mag, and with a width-parameter whose natural logarithm is -3 
(« 0.05 mag), such that the total width of the cut-off is about 0.2 mag and the faintest object on the plate is about 
2 widths from the center of the cut-off. The function value at the bright end is equal to the number of spectroscopic 
objects brighter than r cut divided by the total number of targets brighter than r cut . Thus, the plate-dependent selection 
function is given by 



S^platc,^ g — r) = 

^spectroscopic objects with r„ 



< r < r r 



#targets with r min < r < r cut 



1 — tanh 



0.1 



exp (—3) 



(A2) 



where the numbers of objects are evaluated within the ~ 7 deg 2 area of the plate in question and in the 0.48 < g — r < 
0.55 G-star color range; r min is 14.5 mag for bright plates and 17.8 mag for faint plates. The selection function is 
zero outside of the apparent r-band magnitude range of the plate ([14.5, 17.8] for bright plates and [17.8, 20.2] for faint 
plates). 

We use this model both for the bright plates and the faint plates, although most bright plates are in fact consistent 
with being complete up to 17.8 mag. FIG. 10 shows the distribution of Kolmogorov-Smirnov (KS) probabilities that 
the spectroscopic sample for any given plate was selected from the target sample with this model for the selection 
function. All but 7 plates have probabilities larger than 0.001 and the distribution of probabilities is relatively flat, as 
expected. 

Rather than using a smooth hyperbolic tangent cut-off, we also tried a sharp cut at r cut . With this model for the 
selection function, 79 plates have a KS probability < 0.05 (w25 percent of the number of plates), as opposed to 30 
plates in the hyperbolic-tangent-cut-off model («9 percent of the sample). Therefore, the smooth cut-off is necessary 
to fully model the selection function. The fact that the distribution of KS probabilities in FIG. 10 is not entirely flat 
is due to remaining details in the faint cut-off of the selection function, as we know that the selection function is flat 
at brighter magnitudes. This does not impact our analsis greatly, as most stars are much brighter than the cut-off (as 
compared to the scale over which the selection function changes near the cut-off). 

The selection function is simplest in its native coordinates, survey plate, and r-band magnitude. For each value 
of g — r and [Fc/H], the r-dependent selection function above translates into a (different) spatial selection function 
through the use of the photometric distance relation. The selection function projected into spatial coordinates for a 
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Fig. 10. — Distribution of the probability that a plate's spectroscopic sample was drawn from the photometric sample combined with the 
model selection function. The leftmost bin contains the plates with probability < 0.001. For the correct model of the SEGUE selection 
function, the distribution should be approximately flat between zero and one. 
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Fig. 11. — The SEGUE selection function — the fraction of objects successfully observed spectroscopically with SN > 15 — for the G-star 
sample, as a function of Galactic coordinates X and Y {left panel), and of Galactocentric radius R and vertical height Z {right panel). 
The r-dependent SEGUE selection function is here transformed into spatial coordinates using the photometric distance relation applied to 
a color g — r = 0.515 mag and [Fe/H] = -0.5. 




typical value of g — r and [Fe/H] is shown in FIG. 11. Near \b\ = 90° the spectroscopic sample is relatively complete, 
whereas near the Galactic plane the selection is much less complete. 
We have posted Python code that implements this model for the SEGUE selection function. It is publicly available 

at 

https : / /github . com/ j obovy/ segueSelect . 
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THE MAGNITUDE-COLOR-METALLICITY DENSITY AND ESTIMATES OF THE EFFECTIVE 

SURVEY VOLUME 

The density in magnitude-color-metallicity space needs to be included in the likelihood in equation (6), because 
it forms the basis of the photometric distance relation used to translate observed colors, metallicities, and apparent 
magnitudes into distances, which ultimately relate to the effective search volume. We assume here for simplicity that 
stars of a given g — r and [Fc/H] follow a single stellar isochrone given by the Ivezic et al. (2008) photometric distance 
relation in terms of g — r using equation (1) to translate g — r into the g — i color used by the Ivezic et al. (2008) 
relation. The reason for expressing the Ivezic et al. (2008) g — i-metallicity-magnitudc relation into g — r is to keep 
the integration in equation (7) simple; if we had chosen to use the g — i relation we would have to include the r — i 
color as well, and model and integrate over the full g — r,r — i plane. As the stellar locus is very narrow (< 0.1 mag), 
this adds less (random) scatter than is intrinsic to the photometric distance relation. 

In the single-isochrone model, p(r, g — r, [Fe/H]|_R, Z 7 </>) becomes the product of a delta function with the density in 
the color-metallicity plane 



p(r, g-r, [Fe/H]|i?, Z, 0) = 5(r - r[g - r, [Fc/H] , d] \R, Z, 0) p(g - r, [Fc/H] \R, Z) , (Bl) 



where r[g — r, [Fe/H], d] is the apparent magnitude derived from the photometric distance relation combined with the 
distance, and by a slight abuse of notation we have used the same symbol to denote the density in the color-metallicity 
plane. We assume that this density is independent of Galactocentric azimuth <fi, but for now allow it to depend on R 
and Z. Using this, the normalization integral in equation (7) simplifies to 



J dl db dd dr d(g - r) d[Fc/H] X(l, 6, d,r,g- r, [Fc/H] \6) 
= A p J2 J d(.9-r)d[Fe/H] 

plates p 
/•d[r max ,ff-r,[Fe/H]] 

x / ddS( P , r[g - r, [Fe/H] ,d],g - r) p{g - r, [Fe/R]\R, Z) d 2 V ,{R, z\l, b, d, 9) , 

•/d[r mte ,g-r,[Fe/H]] 



where r m j n and r max are the minimum and maximum apparent magnitude of plate p, and the functions d[-] and r[-] 
use the photometric distance relation. 

The color-metallicity distribution for the a-young and en-old sample is shown in FIG.s 12 and 13, respectively. 
The top-left panel shows the distribution for the entire sample; the remaining panels show the color-metallicity 
distribution as a function of Galactocentric radius (including all vertical heights) and as a function of vertical 
height (including all Galactocentric radii). For both samples, the color-metallicity distribution separates into 
the product of one-dimensional color and mctallicity distributions, thus wc assume that p(g — r, [Fe/H]|i?, Z) = 
p c (g — r\R, Z) p[ Fe / H ] ([Fc/H] \R, Z). The g — r distribution is independent of R and Z for both the a-young and 
the a-old sample; we use a spline interpolation of the color distribution for the full sample for p c (g — r\R,Z), in- 
dependent of R and Z. This interpolation is shown in the top histogram in all panels of FIG.s 12 and 13. The 
metallicity distribution of the a-old sample is also mostly independent of R and Z, with only a hint of a trend toward 
a more metal-poor distribution at large distances from the plane. The [Fe/H] distribution of the a-young sample 
shows expected trends with R and Z: The peak of the metallicity distribution goes from more metal-rich closer to 
the Galactic center and closer to the plane, to more metal-poor at larger Galactocentric radii and at larger Z. These 
shifts are modest (< 0.1 dex), which is partly due to the fact that farther from the Solar radius we preferentially see 
stars at larger distances from the plane. We stress that these metallicity distributions are the observed distributions 
uncorrected for selection effects, but selection effects play a minor role and merely shift the overall distribution by 
0.1 dex (Schlcsingcr ct al., 2011, in preparation). We investigate the effect of systematically shifting the metallicity 
distribution below. 

The effect of mctallicity and color on the absolute magnitude using the Ivezic et al. (2008) color-metallicity- 
magnitude relation is shown in the bottom right panel of FIG.s 12 and 13, for the ranges in color and mctallicity 
considered for both samples. From the blue and metal-rich to the red and metal-poor end the shift in absolute 
magnitude is about 1 mag, or a factor of about 1.6 in distance. 

As the a-old metallicity distribution depends only weakly on R and Z, we will assume that it is constant, and use 
a spline interpolation of the [Fe/H] distribution of the full sample as our model for p[ Fc / H l([Fc/H]|i?, Z). We do the 
same for the a-young sample, even though there are slight trends with R and Z . These models arc shown in the right 
histograms of all panels in FIG.s 12 and 13. We can then simplify the normalization integral in equation (B2) further 



18 





206 stars 
5 < K/kpc < 7 





191 stars 
6 < R/kpc < 7 




1088 stars 
7 < R/kpc < 8 



0.50 0.52 ( 

g-r [mag] 






2535 stars 
8 < fl/kpc < 9 





1130 stars 
9 < fl/kpc < 10 





270 stars 
10 < R/kpc < 11 




132 stars 
11 < fl/kpc < 12 



g-r [mag; 




2065 stars 
250 < |Z|/pc < 750 







690 stars 
1250 < |Z|/pc < 1750 





168 stars 
1750 < |Z|/pc < 2250 





g-r [mag] 




63 stars 
2250 < |Z|/pc < 2750 





19 stars 
2750 < |Z|/pc < 3250 




18 stars 
3250 < |Z|/pc < 4000 




g-r [mag] 





Fig. 12. — Distribution of [Fe/H] and g — r for the full o-young G-dwarf sample (top, left panel) and split into ranges in Galactocentric 
radius R and vertical height \Z\ (other panels). Linear binned densities with contours containing 68, 95, and 99 percent of the distribution 
and individual outliers beyond 99 percent are shown in spatial bins with more than 1,500 stars. A smooth interpolation of the one- 
dimensional distributions in the top left panel is shown in all panels. The bottom right panel shows the absolute r-band magnitude as a 
function of g — r and [Fe/H] from the Ivezic et al. (2008) color-metallicity-magnitudc relation (their eqn. A7). 



to 



/ 



dl db dr dd dr d(g - r) d[Fe/H] A(Z, 6, d, r, g - r, [Fe/H] | (5) 
= A P Jd(9-r)p c (g-r\R,Z) J d[Fe/R]p^([Fe/R]\R,Z) {m) 

plates p 

x / ddS(p,r[g-r, [Fe/E], d], g ~ r) d 2 v*(R, z\l,b, d,6) . 

•Mr mi „,g-r,[Fc/H]] 

If wc then determine the overall minimum and maximum heliocentric distance at which we can observe stars in both 
samples, we can calculate the inner integral between these limits, with the understanding that the selection function 
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Fig. 13. — Same as FIG. 12, but for the ct-old G-dwarf sample. 
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is zero outside of the apparent-magnitude range of the plate in question (since bluer or more metal-rich stars can only 
be observed at distances starting at a value that is larger than the overall minimum distance, and redder and more 
metal-poor stars can only be seen out to distances that fall short of the overall maximum distance, because of the color 
and metallicity dependence of the photometric distance method). We can then calculate the integral by summation 
on a regular grid as 



J dl db dr ddd(g - r) d[Fe/H] A(7, b, d, r, g - r, [Fc/H] \6) 
= A p ^2 ^2d 2 is*(R,z\l,b,d,0) 

plates p d 

E E P C (.9-0P [Fc/H1 ([Fc/H])^,r[ 3 -r,[Fc/H],d],.g-r), 

g-r [Fe/H] 



(B4) 



where the distance summation is between the overall minimum and maximum distance. We dropped integration 
factors Ad, A(g — r), and A [Fe/H], as these only contribute terms that do not depend on the parameters 8 in the 
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log likelihood in equation (8) (note that they do contribute when we do not marginalize over the amplitude of the 
density in equation [6]). Written in this way, this normalization integral can be computed efficiently, as all of the 
necessary coordinate transformations, selection function evaluations, and color-mctallicity-distribution function calls 
can be pre-computed on a dense grid. 

DETAILED DATA VERSUS MODEL COMPARISONS 

In this Appendix we present detailed comparisons of our best-fit density models with the observed data, as ulti- 
mately the best-fit density parameters are constrained through the quality of the fit in the natural coordinates of 
the spectroscopic data (l,b,r,g — r, [Fc/H]). We also show that the results we obtain for different sub-samples of our 
nominal samples are consistent with the best fits for the full samples. As we fit density models by forward modeling 
the underlying density model, i.e., by taking the spatial density and running it through the SEGUE selection function 
and the photometric distance relation, we cannot show direct maps of the density in any meaningful way without 
massaging the data excessively. Therefore, we compare the observed star counts with the best-fit model by running 
the underlying star counts model through the selection function and photometric magnitude-color-mctallicity relation, 
and then comparing it with the observed star counts. This has the added advantage that it shows that the entire 
framework of (a) the underlying density, (b) the photometric magnitude-color-metallicity relation, and (c) our model 
of the SEGUE selection function provides a valid description of the observed data. 

The a-old disk stars 

FIG. 14 compares the observed distribution of vertical heights \Z\ of the a-old G-dwarf sample to that predicted 
by the best-fit model. This prediction is obtained by running the best-fit density model integrated over the color- 
metallicity distribution through our model for the SEGUE selection function. There are 35 stars with magnitudes 
that should put them on faint plates, but that were observed on bright plates, and 551 stars in the opposite situation 
are cut from the data sample to show this comparison. The comparison between the data and the model is shown for 
all plates and for the bright and faint plates separately The agreement between the model and observed distribution 
is excellent for all of these. FIG. 15 shows a similar comparison for the distribution of Galactocentric radii of the data 
and in the model. The model correctly predicts the observed star counts for most Galactocentric radii, except the 
smallest around 5 kpc, where the model slightly overpredicts the number of stars (here, and in further comparisons 
below, the model around 8 kpc behaves somewhat erratically, as this is the boundary between 90° < I < 270° and 
—90° < I < 90° plates, and we do not use the finite extent of the plate in our model distributions). Also shown in this 
figure and in FIG. 14 are models that only differ from the best-fit model in their radial scale length: a model with a 
scale length of 3 kpc and one with a scale length of 4 kpc. It is clear that these longer scale lengths are strongly ruled 
out by the model, as they strongly overpredict the star counts at large Galactocentric radii. 

TABLE 1 lists the best-fit parameters for fits that only use (a) bright or faint plates, (b) b > 0° or b < 0° plates, or 
(c) |6| > 45° or \b\ < 45° plates. The results from all of these different samples are roughly consistent with each other; 
we note that we can even measure the radial scale length with high-latitude plates (\b\ > 45°) alone. 

FIG.s 14 and 15 show comparisons between the observed star counts and the model, when we split the a-old sample 
into more metal-poor and more metal-rich sub-samples by cutting the sample at [Fe/H] = —0.7. Comparisons for 
when we split the a-old sample into two bins in [a/Fe], by cutting the sample at [a/Fe] = 0.35, are shown in FIG. 16. 

The a-young disk sample 

FIG. 17 compares the best-fit model to the observed star counts as a function of vertical height and FIG. 18 shows 
this comparison as a function of Galactocentric radius, again removing 4 stars with magnitudes that should put them 
on faint plates but that were observed on bright plates and removing 43 stars in the opposite situation. We also show 
models whose parameters are the same as those of the best-fit model, but with shorter scale lengths of 2 and 3 kpc. 
The faint plates, which only contain 6 percent of the a-young sample, rule out a short scale length of 2 kpc for the 
a-young disk. The best-fit model provides a good fit to the observed star counts. 

We again also list the best-fit parameters for fits that only use (a) bright or faint plates, (b) b > 0° or b < 0° 
plates, or (c) \b\ > 45° or \b\ < 45° plates in TABLE 2. The results from all of these different samples arc again 
roughly consistent, except for the faint plates fit, which prefer even longer radial scale lengths, but faint plates only 
contain 6 percent of the a-young sample. We have also run fits for the a-young sample where we (a) employ a more 
conservative SN cut of SN > 30, (b) enlarge our sample with a less conservative SN cut of SN > 10, (c) remove stars 
on plates whose K-S probability for the spectroscopic sample to have been drawn from the underlying photometric 
sample combined with our model for the SEGUE selection function (see FIG. 10) is smaller than 0.1, (d) use stars 
from the SEGUE database that were explicitly targeted as G-type stars (with all other log g, SN, E(B — V) cuts), 
(e) remove stars with magnitudes that should put them on SEGUE bright plates, but that were observed as part of a 
faint plate and vice versa, and (f) artificially shift the metallicity distribution 0.1 dcx toward the more metal-rich end. 
The results from these fits arc all consistent with those obtained for our nominal sample with fiducial cuts. 

FIG.s 17 and 18 show comparisons between the observed and predicted star counts for the a-young samples that 
are more metal-poor than the nominal a-young sample. The fit for the —0.6 < [Fc/H] < —0.3 sample is good, while 
the fit for the most metal-poor a-young sub-sample is not entirely satisfactory. Comparisons between the observed 
star counts and the model when we split the a-young sample into two, by cutting at [a/Fc] = 0.15, are shown in the 
lower two rows of FIG. 16. 
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Fig. 14. — Comparison between the observed distribution of vertical heights of the ct-old G-dwarf sample and the distribution predicted 
by the best-fit mixture of two double-exponential disks. The dashed and dotted lines show the same model, but with a scale length of 3 
and 4 kpc, respectively. The bottom two rows show the nominal sample used in the top row split at [Fe/H] = —0.7. See TABLE 1 for the 
parameters of the best-fit models. 



ANALYSIS TEST ON MOCK DATA SAMPLES 

In order to test the methodology for fitting the density discussed in § 3, and as check on the code, we create mock 
data samples selected in exactly the same way as the SEGUE G-dwarf sample and fit them using our algorithm. We 
also use this framework to test whether the results we obtain can plausibly be the result of abundance errors smoothing 
out an underlying two-component thin-thick disk structure. 

We create mock data sampled from a model underlying density by calculating, for each line of sight, (i) the fraction 
of stars in the sample that lies along that line of sight and (ii) the distribution in r-band magnitude as a function of 
color g — r and mctallicity [Fc/H]. For calculating both of these, we take the SEGUE selection function, described in 
Appendix A, into account. Thus, we can obtain a sample that is equivalent to what SEGUE would have observed for 
a particular density model. 

To test the methodology and code we populate each mono-abundance bin in the ([Fe/H],[o;/Fc]) plane with a sample 
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Fig. 15. — Same as FIG. 14, but for the distribution of Galactocentric radii. The dashed and dotted lines show the same model but with 
a scale length of 3 and 4 kpc, respectively. 



drawn from a thin-disk component with h z = 300 pc and Iir = 3.5 kpc, keeping the abundances and number of stars 
in each bin the same as in the observed sample. We then run the same analysis code on this sample as is run to 
produce the real data results in FIG.s 4 to FIG.s 6. We find results that are consistent with the input model within 
the uncertainties for each bin. The uncertainties are similar to those found for the real data near h z = 300 pc and 
hji = 3.5 kpc. We repeat this for an input "thick" disk model with h z = 850 pc and ■ = 2 kpc, and again find results 
that are consistent with the input model within the uncertainties. 

We use a similar procedure to investigate whether abundance errors can smooth out an underlying disk model made 
up of a thin- and thick-disk component without showing up in our analysis. Assuming SEGUE abundance uncertainties 
of 0.2 dex in [Fe/H] and 0.15 dex in [a/Fc], we first model the underlying abundance distribution using two Gaussian 
components, and fit this model to the observed distribution with the assumed abundance uncertainties using the 
extreme-deconvolution technique (Bovy et al. 2011). We use a Gaussian mixture model for the underlying distribution 
solely as a convenient of decomposing the observed distribution for the purpose of this test. The two-Gaussian mixture 
model adequately represents the observed distribution after convolving again with the uncertainties. The best-fit 



23 




\Z\ [kpc] 



Fig. 16. — Structural parameters of the G-dwarf sample as a function of [ct/Fe]. The top two rows show the ct-old sample split at [o/Fc] 
= 0.35; the bottom two rows contain the ct-young sample split at [ct/Fe] = 0.15. The left column compares the observed distribution of 
vertical heights to the model distribution. The right column does the same for the distribution of Galactocentric radii. Each sample is 
fit with a mixture of two double-exponential disks. The best-fit parameters of the dominant disk component are shown in the top-left of 
each panel. In each case the secondary component only contributes a few percent of the mass at the solar circle (see TABLEs 1 and 2 for 
detailed results). 
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Fig. 17. — Comparison between the observed distribution of vertical heights of the ce-young G-dwarf sample and the distribution predicted 
by the best-fit mixture of two double-exponential disks model. The dashed and dotted lines show the same model, but with a scale length 
of 2 and 3 kpc, respectively. The bottom two rows show samples with the same [a/Fe] range of the nominal a- young sample, but with 
different [Fe/H] boundaries. See TABLE 2 for the parameters of the best-fit models. 



mixture has a Gaussian centered near solar abundances (40 percent of the sample; [Fc/H] = —0.3 dex, [a/Fe] = 0.1 
dex) and one at metal-poor and a-enhanced abundances (60 percent of the sample; [Fe/H] = —0.7 dex, [a/Fe] = 0.35 
dex). To reproduce the observed distribution, these components both need a dispersion of 0.2 dex in [Fe/H] and 0.07 
dex in [a/Fe], with a correlation of —0.85 and —0.6, respectively. We then assign stars to these two components 
with probabilities computed from their posterior probability of being drawn from either component, based on their 
abundances and assumed abundance uncertainties. We sample new r-band magnitudes and coordinates for these stars 
based on the component they are assigned to: we draw the stars assigned to the solar- abundances component from 
a thin-disk density with h z = 300 pc and Hr = 3.5 kpc, and stars assigned to the a-enhanced component from a 
thick-disk distribution with h z = 850 pc and hu = 2.5 kpc. We then run the same analysis code on this sample as is 
run on the real data. 

The results from this test are shown in FIG. 20. Although in certain respects they are similar to the results for the 
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Fig. 18. — Same as FIG. 17, but for the distribution of Galactocentric radii. The dashed and dotted lines show the same model, but with 
a scale length of 2 and 3 kpc, respectively. 



real data, they arc different in a few crucial ways. Most importantly, when fitting a mixture of two exponential models 
to each bin we find unambiguous evidence in many bins for two components. This is shown in the lower right panel 
of FIG. 20, where the scale height of the dominant component when fitting the mixture is shown vs. the scale height 
when fitting a single exponential. For most bins with single-exponential h z < 800 pc, the dominant component is the 
h z = 300 pc input thin-disk component. Therefore, even though the abundance pattern of the single-exponential scale 
height in the lower left panel of FIG. 20 is smooth between thin and thick components, most bins are actually resolved 
into the two input components. This is a major difference with the real data, for which the equivalent comparison, 
shown in FIG. 6, shows a striking one-to-one correlation between the single-exponential and the mixture scale height, 
with no evidence for a second component for the vast majority of the mono-abundance bins. 

In addition to the fact that our analysis correctly identifies two components in each mono-abundance bin in the 
mock data, the abundance dependence of the inferred single-exponential scale height and scale length is also quite 
different from that of the mock data. The inferred scale length for the mock data is short for most abundance bins and 
only reaches hji > 3 kpc for those abundance bins that arc farthest from the center of the metal-poor and a-enhanced 
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Fig. 20. — Results from applying this paper's analysis to a mock data sample composed of a two-component thin-thick disk sample. 
Assuming abundance errors of 0.2 dex in [Fe/H] and 0.15 in [ct/Fe], we deconvolve the observed distribution in ([Fe/H], [ct/Fe]) (FIG. 2) 
into two Gaussian components, which are assumed to correspond to a thin component (h z = 300 pc, fiR = 3.5 kpc) and a thick component 
(h z = 850 pc, h,R = 2 kpc). The panels on the left show the results of fitting single-exponential models in each mono-abundance bin, as in 
FIG. 4. The top panel on the right shows the single-exponential scale height vs. scale length color-coded by [o/Fe]. The bottom right panel 
shows the scale height of the dominant component when fitting a mixture of two cxpontial components vs. the single-exponential scale 
height. In contrast to the fits for the real data sample, this latter plot shows that the mixture-model is preferred and thus we confidently 
infer that in the mock-data sample each bin is made up of two populations (at least for those with single-exponential h z < 800 pc). Other 
differences with the real data are that most mono-abundance bins have a short single-exponential scale length due to the contamination 
from the short, thick disk component and, similarly, that the single-exponential scale heights quickly reach "thick" values of h z > 600 pc 
when going to more metal-poor and [cr/Fe] -enhanced populations. We conclude that abundance errors cannot turn a clearly thin-thick 
separated disk sample into the results we observe for the real data set in FIGs 4 to 6. 



abundance component. Thus, the contamination from the thick-disk component with its short scale length drives the 
inferred scale length for most abundance bins to small values. This behavior is not observed in the real data (FIG. 4). 
The abundance dependence of the single-exponential scale height for the mock data is also much steeper than observed 
in the real data, with values of h z > 600 pc as metal-rich as [Fe/H] = -0.3. 

From these tests we conclude that abundance errors cannot explain the single-exponential components we observe in 
each mono-abundance bin in the real data or the abundance behavior of the scale height and scale length. Based on an 
entirely different argument that uses the observed isothcrmality of the vertical kinematics of the same mono-abundance 
populations, Bovy et al. (2012b) infer that the internal SEGUE abundance uncertainties are likely somewhat smaller 
than the values reported by SEGUE, with likely uncertainties of 0.15 dex in [Fc/H] and 0.07 dex in [a/Fe]. Thus, 



27 



abundance uncertainties do not influence the main conclusions of this paper. 
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